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ABSTRACT 

The correlation between a galaxy's morphology and its observed optical spectrum is inves- 
tigated. As an example, 4000 galaxies from the 2dF Galaxy Redshift Survey, which possess 
both good quality spectra and have visually determined morphologies are analysed. Of par- 
ticular use is the separation of Early and Late type galaxies present in a redshift survey since 
these can then be used in their respective redshift-independent distance estimators (D n — a 
and Tully-Fisher). It is determined that galaxies in this sample can be relatively successfully 
separated into these two types by the use of various statistical methods. These methods are 
briefly outlined in this paper and are also compared to the default 2dFGRS spectral classifica- 
tion r\. In addition it is found that the 4000A break in the spectrum is the best discriminant in 
determining its morphological type. 

Key words: methods: statistical - galaxies: elliptical and lenticular, cD - galaxies: spiral 
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1 INTRODUCTION 

The classification of galaxies according to their observed mor- 
phologies has proved to be a very useful way of characterising 
different galaxy populations (see e.g. Hubble 1936). However, the 
high-resolution imaging data required to make such accurate mor- 
phological classifications, over a wide range of redshifts, is often 
unavailable in large galaxy redshift surveys. In this paper, a selec- 
tion of different statistical methods are investigated, with the aim of 
establishing a quantifiable link between a galaxy's morphology and 
its observed optical spectrum; thousands of which are now avail- 
able through the advent of large galaxy redshift surveys such as the 
Sloan Digital Sky Survey (York et al. 2000) and the 2dF Galaxy 
Redshift Survey (2dFGRS, Colless et al. 2001). In particular, a set 
of 4000 galaxies which have already been observed in the 2dF- 
GRS, for which the morphology and spectrum have been deter- 
mined, is used as a training set. It has long been established that 
a substantial link exists between the overall structure of a galaxy 
and the chemical properties reflected in its spectrum - which quan- 
tifies its stellar and gas composition (e.g. Morgan & Mayall 1957). 
However, efforts to quantify this relationship have been somewhat 
hampered due to the lack of large, representative, data-sets. For ex- 
ample, Folkes, Lahav & Maddox (1996) used a sample of only 26 
unique galaxy spectra and morphologies in a similar analysis to 
that presented here. This is fortunately no longer an issue with the 
advent of fibre-based galaxy redshift surveys, which are able to ac- 
quire several hundred galaxy spectra per hour. 

These surveys are producing extremely large data-sets for 
which the spectrum of a galaxy is known but its detailed struc- 
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tural parameters generally are not, or are very difficult to determine 
accurately from the photometry of the input catalogue over a rep- 
resentative redshift range. Fortunately, the spectrum of a galaxy is 
generally a more robust quantity to measure over a variety of red- 
shifts and as such if a substantive link between optical spectra and 
these parameters can be determined this will greatly enhance our 
ability to probe the properties of our local galaxy population. 



Another more specific advantage of being able to determine a 
galaxy's morphology from its spectrum is that it allows the iden- 
tification of targets for peculiar velocity follow-ups (using either 
D n — a for elliptical galaxies or the Tully-Fisher relation for spi- 
rals). 



In this paper two statistical techniques are investigated, to 
determine how accurately morphology can be estimated from a 
galaxy's optical spectrum. Both methods are 'supervised' - mean- 
ing that they require a training set of galaxies with both visually 
determined morphologies and observed spectra. The first method to 
be implemented is Fisher's linear discriminant (Section 4), which 
attempts to determine an optimal linear combination of inputs (the 
spectrum) to distinguish between several outputs (the morpholo- 
gies). The second method is an Artificial Neural Network (ANN, 
Section 5) which creates non-linear combinations of the input, and 
outputs a selection of class probabilities. The possible biases in- 
troduced into this work by systematic effects are described in Sec- 
tion 6 and then the success rates of each method are compared to 
those achieved using the default 2dFGRS spectral classification pa- 
rameter, 77, in Section 7. 
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2 GALAXY MORPHOLOGIES IN THE 2DFGRS 

A number of galaxies in the 2dFGRS have already had their mor- 
phologies determined manually by direct examination of the APM 
Galaxy Survey images (Maddox et al. 1990a,b see Fig. |l|). The ac- 
curacy and completeness of this sample varies a great deal depend- 
ing on the source of the classification and the range of galaxy mag- 
nitudes considered. In what follows I primarily make use of those 
galaxies which were ascribed morphologies in the APM Bright 
Galaxy Catalogue (Loveday 1996). This catalogue provides a com- 
plete sample of classified galaxies down to a magnitude limit of 
bj = 16.44. The exact value of this magnitude limit now varies 
across the sky due to recent re-calibrations of the APM magnitudes 
(see e.g. Colless et al. 2001 for details of the most recent calibra- 
tions of those galaxies included in the 2dFGRS), however this will 
not have any substantial impact upon the representativeness of our 
classified sample. A significant number of galaxies at fainter mag- 
nitudes also have morphologies determined from other sources, but 
these will be substantially less reliable and so are not used in this 
analysis. 

Of those 6j < 16.5 galaxies which have been successfully 
observed so far in the 2dFGRS, 3899 have a morphological classi- 
fication (Fig. |^). The galaxy morphologies are given in four broad 
bins: Elliptical, SO, Spiral and Irregular. However, in the analysis 
presented here they are rebinned into only two classes; Early (Ellip- 
tical,S0) and Late (Spiral,Irregular) types. The reasons for doing so 
are two-fold: Firstly, the number of classified galaxies which have 
been identified as SO or Irregular are significantly smaller than the 
number of Spirals. Therefore the identification of these types will 
be greatly hindered by the presence of Spiral outliers - which ef- 
fectively swamp out any identifying signal which may arise from 
these types. Secondly, the distinction between Early and Late type 
galaxies is of fundamental importance to observational cosmology 
since each can be used in its own redshift-independent distance es- 
timator, i.e. D n — g for Early types (Dressier et al. 1987) and the 
Tully-Fisher relation for Late types (Tully & Fisher 1977). 

Note that this sample of galaxies consists entirely of relatively 
bright, nearby galaxies and so may not be representative of the en- 
tire 2dFGRS galaxy population. Another important point to bear in 
mind is that as these galaxies are relatively extended on the sky, 
the spectra observed of them (through the fixed fibre aperture of 
the 2dF instrument) may not be representative of the entire galaxy. 
This so called 'aperture effect' is a difficult issue to address and has 
led to much discussion in the literature (see e.g. Kochanek, Pahre & 
Falco, 2000; Madgwick et al. 2002). The possible impact of aper- 
ture effects on our results is discussed further in Section n. 



3 GALAXY SPECTRA IN THE 2DFGRS 

The purpose of this analysis is to relate the spectrum of a galaxy to 
its morphology. In the case of the 2dFGRS each spectrum consists 
of 1024 channels spanning the wavelength range of approximately 
3700-8000A, thereby including all the major optical diagnostics be- 
tween 0[II] and Ha (see Folkes et al. 1999, for further details). As 
an illustration, the average 2dFGRS spectrum for a representative 
volume-limited sample is shown in Fig. |3| 

Rather than dealing with all 1024 spectral channels in the sub- 
sequent analysis (to represent a given spectrum), it is possible to 
take advantage of the fact that the vast majority of these chan- 
nels are redundant by means of some form of data compression. In 
the analysis presented here use is made of a Principal Component 
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Figure 2. The distribution of redshifts of those galaxies classified as Early 
or Late in the 2dFGRS. Note that in order to ensure that the classification is 
relatively robust we only include those galaxies within the magnitude limit 
6j < 16.5. 

Analysis (PC A, see e.g. Murtagh & Heck 1987) since this compres- 
sion algorithm has met with considerable success in dealing with 
galaxy spectra (e.g. Connolly et al. 1995; Galaz & de Lapparent 
1998; Folkes et al. 1999; Madgwick et al. 2002). 

3.1 Review of PCA 

PCA is a well established statistical technique which has proved 
very useful in dealing with high dimensional data sets. In the partic- 
ular case of galaxy spectra we are typically presented with approx- 
imately 1000 spectral channels per galaxy, however when used in 
applications this is usually compressed down to just a few numbers, 
either by integrating over small line features - yielding equivalent 
widths - or over wide colour filters. The key advantage of using 
PCA in our data compression is that it allows us to make use of all 
the information contained in the spectrum of a galaxy in a statisti- 
cally unbiased way, i.e. without the use of such ad hoc filters. 

In order to perform the PCA on our galaxy spectra we first 
construct a representative volume limited sample of the galaxies. 
When we apply the PCA to this sample it constructs an orthogonal 
set of components (eigenspectra, herein denoted PCi,PC2,etc) 
which span the wavelength space occupied by the galaxy spectra. 
These components have been specifically chosen by the PCA in 
such a way that as much information (variance) is contained in 
the first eigenspectrum as possible, and that the amount of the re- 
maining information in all the subsequent eigenspectra is likewise 
maximised. Therefore, if the information contained in the first n 
eigenspectra is found to be significantly greater than that in the re- 
maining eigenspectra we can significantly compress the data set by 
swapping each galaxy spectrum (described by 1000 channels) with 
just those first n projections (denoted pc\,pc2 etc). The variances 
corresponding to the first 10 principal components derived in this 
manner are shown in Table ll. 



Correlating galaxy morphologies and spectra in the 2dFGRS 3 



c 

O 

o 

CD 




4000 



5000 
Wavelength (A) 



TGS537X080 ( I ,) 
6000 



• 



Figure 1. A selection of example galaxy spectra and images are shown. The spectra have been taken from the 2dF Galaxy Redshift Survey and are in units 
of counts/bin with arbitrary normalisation. The images shown are lxl arcmin postage stamps, in the standard astronomical orientation, taken from the 
SuperCOSMOS Sky Survey (Hambly et al. 2001). For each spectrum the 2dFGRS object name is given and the labels (L) and (E) refer to Late-type and 
Early-type morphologies respectively. 



Table 1. Variance contained in the first 10 principal components of the 2dF- 
GRS galaxy spectra. 



Component 


Variance (%) 


Component 


Variance (%) 


1 


54 


6 


0.99 


2 


15 


7 


0.86 


3 


4.0 


8 


0.57 


4 


2.7 


9 


0.41 


5 


1.2 


10 


0.27 



Note that the PCA is merely a statistical tool, we do not imply 
(yet) that any of these components are physically significant, but 
rather we are merely using them as a method of data compression. 

During the PCA analysis it was found that the eigenspectra be- 
came dominated by unphysical broad features from the fifth eigen- 
spectrum (PCs) onwards. This was due to artifacts from sky emis- 
sion features which we were unable to completely remove during 
the spectral reduction. Rather than restrict our future analysis to 
only the first four principal components, we have instead repeated 
the analysis with the wavelength range 5850-6200A masked out. 
This has no noticeable effect on the original first four eigenspectra, 
all of which are essentially identical between the two PCA imple- 
mentations. However, it does allow us to probe deeper into the PCA 
space than we would otherwise have been able to (up to PCg) . 

It is worth noting that most galaxy spectra can be accurately 



reconstructed using only the first 4-5 principal components. In our 
subsequent analysis we make use of the first nine (this choice is 
justified in detail by Folkes, Lahav & Maddox 1996). 

In order to gain some initial understanding of the link between 
galaxy spectra and morphologies, histograms are shown in Fig. ^ of 
the distributions of principal component projections for both Early 
and Late type galaxies. Also plotted are the eigenspectra for each 
of these projections, which illustrate what spectral features are en- 
coded in the respective projections (Fig. |5). Included in each of 
these plots are the corresponding results for the r\ = 0.5pci — pc2 
projection which is used to spectrally classify the 2dFGRS galaxies 
(see next Section). It can be seen that most of the first nine prin- 
cipal components encode some degree of information about the 
morphology of a galaxy, although none are capable of separating 
Early from Late type galaxies accurately (with the exception of the 
r\ spectral classification). The aim of the analysis presented in this 
paper is essentially to determine if it is possible to improve the sep- 
aration between Early and Late types by taking either linear (Fisher 
Analysis, Section^) or non-linear (Artificial Neural Networks, Sec- 
tion™ combinations of these projections. 



3.2 Spectral classification in the 2dFGRS 

The 2dF instrument (Lewis et al. 2002) was designed to measure 
large numbers of redshifts in as short an observing time as possi- 
ble. However, in order to optimise the number of redshifts that can 
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Figure 3. The average spectrum of an Mj,, — 5 log 10 (?t) > 
spectral features present are labelled. 



-18 volume-limited sample of galaxies drawn from the 2dF Galaxy Redshift Survey. The main 



be measured in a given period of time, compromises have had to 
be made with respect to the spectral quality of the observations. 
Therefore if one wishes to characterise the observed galaxy pop- 
ulation in terms of their spectral properties care must be taken in 
order to ensure that these properties are robust to the instrumental 
uncertainties. 

The quality and representativeness of the observed spectra can 
be compromised in several ways and a full list is presented in pre- 
vious work (see e.g. Madgwick et al. 2002). The net effect is that 
the uncertainties introduced into the fibre-spectra predominantly af- 
fect the calibration of the continuum slope and have relatively little 
impact on smaller-scale features such as the emission/absorption 
line strengths. For this reason any given galaxy spectrum which is 
projected into the plane defined by {pc\,pc2) will not be uniquely 
defined in the direction of varying continuum but will be robust in 
the orthogonal direction (which measures the average line strength, 
see Fig. W). 

The projection onto this robust axis is denoted by r\ 
(ETA.TYPE in the 2dFGRS catalogue^]), 



r\ = a • pci — pc2 



(1) 



Where a is a constant which we find empirically to be o = 0.5 ± 
0.1. This (continuous) variable 77, being the single most important 
component of the galaxy spectra which was robust to instrumental 
uncertainties, was chosen as the measure of spectral type in 2dF- 
GRS. 

Note that although 77 is a continuous measure of type, it is 
often useful to divide a galaxy sample into different bins to simplify 
subsequent analyses. One of the most common divisions to make 
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is to separate those galaxies with 77 < —1.4 from those with 77 ^ 
— 1.4, referred to as relatively quiescent and star-forming galaxies 
respectively. 



4 FISHER'S LINEAR DISCRIMINANT 

It is clear from the trends in Fig. Q that morphology is indeed re- 
lated to the spectra of galaxies although, with the exception of the 
77 spectral classification, the correspondence is not particularly pro- 
nounced. However, as shown by 77, it may be possible to improve 
this situation by taking various linear combinations of the projec- 
tions in order to isolate those particular parts of a galaxy's spectrum 
which contain the most information regarding its morphology. 

In this section a method first proposed by Fisher (see e.g. 
Bishop 1995 and references therein) is considered, for linearly re- 
ducing data dimensionality (in this case we hope to go from our 9 
principal components to 1 morphology) in a way which will op- 
timally distinguish between different classes of objects. Strictly 
speaking this method will not create a discriminant between the 
two classes but rather, once we have reduced the dimensionality, it 
should become clear how to divide the sample. 

As mentioned before, this method is a linear approach to this 
problem. In the next section this will be generalised to consider the 
possibility of a non-linear relationship between morphology and 
spectra through the use of Artificial Neural Networks. 



4.1 Mathematical formulation 

The mathematical formulation of Fisher's linear discriminant is 
straight-forward. We consider a set of input vectors x; we wish 




Figure 4. The distribution of r\ for E/SO (dotted line) and Spiral galaxies (solid line) is shown in the top left panel. The other panels show these distributions 
for the first 9 principal components. 
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Figure 6. The distribution of (pci,pca) projections of the observed 2dF 
galaxies is shown. Also shown are the projections which maximise either 
the continuum or emission/absorption component in each spectrum. The 
latter of these projections is used to classify the 2dFGRS galaxies. 



to determine the weights w, such that the projection y is the most 
discriminatory between our two classes, 



T 

y = w x 



(2) 



The mean vectors mi and ni2 of each class are given by, 



(3) 



Here n £ Ct are the elements of class Ck (in this case k = {1, 2} 
for Early and Late types respectively). After projection, the scalar 
separation between the means will be, 



m 2 — m 1 = w (m 2 — mi) , 
and the within-class covariance is, 



2 I n \2 

s = [V - m k ) 



(4) 



(5) 



nGCfc 



Fisher's assertion was that the optimal mapping to be used in re- 
ducing our dimensionality should be such as to maximise, 



J(w) 



(m 2 - mi) 2 



(6) 



This is easy to interpret physically: We wish to find a set of weights 
w such as to maximise the mean separation between the two classes 
(m-2 —mi) after we have performed our projection. The variance in 
the denominator takes into account the fact that our initial vectors x 
may have different spreads and hence differing degrees of overlap 
in different directions. Clearly we wish to find a projection which 
minimises this overlapping between our two classes. 

To determine the optimal weights we simply need to re- 
introduce the explicit weight dependence into Eqn. ^ and differ- 
entiate. It is shown in Bishop (1995) that the solution is then, 



w oc S w 1 (m2— mi) 



Where, 



C n£C 



(7) 



(8) 
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Figure 5. The first 9 principal components. Also shown is the spectrum of the r\ component (top panel), used to classify the 2dFGRS. 



is the total within-class covariance matrix. Equation ^ only yields 
the direction of the weight vector, not its magnitude. Convention- 
ally the magnitude of w is taken to be such that J^. wrf = 1, 



4.2 Classification using the Fisher analysis 

After performing the Fisher analysis on the complete set of mor- 
phologically classified 2dFGRS galaxies, the resulting weighting 
vector is found to be, 

w = (1.2, -1.3, 3.5, 0.18, 0.34, -0.96, -0.68, -6.5, 6.4)/10 .(9) 



Figure ^ shows the resulting Fisher projections using these weights 
for both the known Early and Late type galaxies in the 2dFGRS. It 
is clear from this figure that the method has been relatively success- 
ful in that the distinction between Early and Late type galaxies is 
now much more pronounced than for any of the individual principal 
components (Fig.Q). 

Having derived this projection it is now necessary to deter- 
mine 'by-hand' a cut that we will use to distinguish Early from 
Late type galaxies in the future. The actual cut adopted will, of 
course, depend on the specific application for which one requires 
the galaxies to be classified. For example, if one requires a com- 
plete sample of Early type galaxies a relatively high cut should be 



Correlating galaxy morphologies and spectra in the 2dFGRS 1 




-10 



-20 



E/SO 



pCj 



-20 




Spirals 



10 10 

pCj 



Figure 8. The pci and pc2 projections of the morphologically classified 2dFGRS galaxies are shown. Here the morphologies have been derived using Fisher's 
method with 9 principal components. It is remarkable that all the information in these 9 components is essentially contained in these first two projections. The 
Fisher discriminant has been cut at -0.55. 
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Figure 7. Fisher's linear discriminant, as calculated from the first nine prin- 
cipal components with the aim of discerning the galaxy morphology. It is 
clear a significant degree of overlap between the morphological types still 
exists. 



adopted compared to if one requires a relatively 'pure' sample. We 
note that in the context of deriving a pure sample, the results appear 
to be somewhat disappointing in that the Fisher analysis does not 
appear to be as discriminatory between Early and Late type galax- 
ies as would otherwise have been hoped. 



From Fig. ^ we conclude that a good general distinction be- 
tween Early and Late type galaxies can be achieved by cutting 
the distribution at —0.55. Using this cut gives the total number of 
galaxies classed Early and Late as (814,454) and (475,2156) re- 
spectively, where (xl,x2) represents xl galaxies that are genuinely 
Early type and x2 that are genuinely Late type. Note that the con- 
tamination between types will vary substantially depending on the 
selection criteria of the redshift survey, and so we are primarily in- 
terested in the success rates of the classification i.e. 63% of Early 
types and 82% of Late types successfully classified, as opposed to 
the degree of contamination. 

The distribution shown if Fig. ^ is very similar to that shown 
in Fig. Q for r\. Indeed, Fig. ^ shows the (pc\,pc2) projections for 
our sample of galaxies separately, depending on whether they satis- 
fied the Fisher —0.55 criterion for being Early or Late type. Over- 
plotted on this figure is the cut imposed to classify the 2dFGRS into 
Type 1 (Quiescent) galaxies using r\ (see Section 3.2). The corre- 



spondence is remarkable. Clearly much of the physical information 
carried in the PCA eigenspectra is contained in the first two prin- 
cipal components. It is also interesting that repeating the Fisher 
analysis using only these first two principal components yields a 
classification parameter very similar to r\ (with very similar success 
of classification). These two forms of classification (Fisher and rf) 
are contrasted further in Section^ 

It is worth noting that most spectral classifications are in fact 
indirectly anchored to morphological types by use of a training set 
(e.g. the Kennicutt Atlas, Kennicutt 1992) of galaxy spectra for 
which the morphology is known (see e.g. Connolly et al. 1995; 
Folkes et al. 1999). However, in the case of the 2dFGRS, rj was 
chosen purely on the grounds of robustness with respect to instru- 
mental uncertainties in determining the spectral continuum. This 
point is discussed further in Section ^. 
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4.3 Spectral features 

If we combine the PCA eigenspectra (Fig. ^ using our derived 
weighting vector w we can determine which spectral features are 
being used by the Fisher criterion to measure galaxy morphology. 
This spectrum is shown in Fig. ^. 

It must be borne in mind, when attempting to interpret this 
spectrum, that the PCA is mean-subtracted, i.e. the average 2dF- 
GRS spectrum has been subtracted from each individual spectrum 
before calculating it's projection (see Fig. ^). Therefore negative 
values in this eigenspectrum represent either absorption or below 
average emission, whereas positive values correspond to above av- 
erage emission. 

Perhaps the most striking feature of the spectrum shown in 
Fig. |9] is its overall lack of nebular emission features, despite the 
fact that these dominate most of the PCA eigenspectra (see Fig. ^|). 
It is interesting that this spectrum, which is so clearly different to 
that used to calculate rj (see top panel of Fig. fy, should give such 
a similar classification. Clearly there is a very pronounced corre- 
lation between the nebular emission features used to calculate r\, 
and the strength of the 4000A break and (to a lesser degree) the 
Ha emission line, as measured by this combined eigenspectrum. 
The apparent broad feature at 5200A (corresponding to the Mgb 
absorption line) is an artifact introduced due to our truncating the 
series of eigenspectra just as this feature was becoming prominent 
in them (it can be clearly seen in PCs and PC9 in Fig. |5|), and 
as such this is not contributing to the success of the classification 
itself. 

In fact it is possible to determine more accurately exactly what 
parts of this spectrum are being used by the Fisher criterion to deter- 
mine morphology by 'masking-out' certain segments of the com- 
bined eigenspectrum of Fig. ^, and then seeing how accurately one 
can recover the same classification (after re-projecting the galaxy 
spectra onto it). By doing this it can be shown that the entire clas- 
sification is based on the part of the eigenspectrum below 4800A, 
and hence that the morphology of a galaxy can be most accurately 
determined from the size of the 4000A break, without the use of 
any further spectral information (although using the entire spec- 
trum does not degrade the result, and hence will continue to be 
used throughout this work). 



5 ARTIFICIAL NEURAL NETWORKS 

This section further refines the analysis of the previous section by 
considering the possibility of a nonlinear relationship between a 
galaxy's spectrum and its morphology. This is done by making use 
of an Artificial Neural Network, which is trained to identify galaxy 
morphology after being inputted with the galaxy spectra (from the 
first nine PCA eigenspectra). 



5.1 Brief description 

An artificial Neural Network (ANN) is a mathematical construct 
originally designed to simulate the functioning of the brain (see 
e.g. Ripley 1996). The network itself is designed around a set of 
layers, consisting of input nodes, hidden nodes and output nodes. 
The connections between these nodes can be quite complex and in 
most instances all the nodes in a previous layer are connected to all 
the nodes in the next layer. Because of this complexity the ANN 
can train itself to recognise highly nonlinear relationships between 
the inputs and the outputs we desire. 



Each connection between two nodes has an associated weight, 
Uij . The total input to any given node is the sum of all the individual 
inputs weighted by Uij . Each node takes this total input and applies 
a transfer function to it, before passing it on to the next node. The 
transfer function is generally taken to be a sigmoid function, 



1 



1 + exp(-z) 



(10) 



since this keeps the output in the range [0,1], hence limiting the 
variation in the possible weighting schemes. 

In order to train the network we need to establish a cost func- 
tion. This is usually taken to be the Euclidean separation between 
the desired outputs, Ti, we hope to receive (at each of our output 
nodes), and the actual outputs, .F(u, Xi), 



N 

S=i^[T i -F(u,x i ) 



(11) 



All the weights in our network are initially set to random values. 
The network is then trained on our training data-set by inputing 
the first four principal components and comparing the output to 
the desired morphology of each object. The weights are adjusted 
by means of back-propagation until a global minimum in the cost 
function is found. The network is then fully trained and can be run 
on the testing set to establish its success. 

Another aspect of neural networks is that of weight decay, 
whereby one can specify an additional factor in the cost function 
in order to restrain the magnitude of the weights. We do not make 
use of this refinement for the following reason: If one specifies mul- 
tiple outputs for the neural network (in our case two outputs, corre- 
sponding to Early and Late) these can in fact be interpreted as the 
probabilities that a galaxy belongs to either class - so long as the 
weight decay is set to 0. 

Further details about the mathematical formulation and inter- 
pretation of ANNs is given in e.g. Lahav et al. (1996) and Bishop 
(1995). 



5.2 Results from the ANN 

The ANN was run using three different configurations, all of which 
had 9 inputs and 2 outputs, and as such were only distinguished 
by the structure and size of their hidden layers. The first network 
used had a single hidden layer of 5 nodes (configuration 9:5:2), 
the second increased this to 9 nodes (configuration 9:9:2), and the 
final ANN had two hidden layers with 5 nodes each (configuration 
9:5:5:2). 

In each case the network required both a 'training' and a 'test- 
ing' data set. Because there were so many more Late than Early 
type galaxies in our sample (2631 versus 1268 respectively), 800 
of each type were randomly selected for the purposes of training. 
The reason that equal numbers of each type were used for training 
the ANN was because ANNs are essentially complicated Bayesian 
classifiers, as such the detection of each type will be biased by 
by the prior distributional information of that type, which is de- 
termined by the selection criteria of the redshift survey. By using 
equal numbers of each type of galaxy in our training set we ensure 
that the results presented here are general and not only applicable 
to 6j-selected galaxy samples, which are biased towards recently 
star-forming galaxies. 

The remaining 2299 galaxies were used as a testing and vali- 
dating set. The purpose of validation is to ensure that the ANN does 
not 'over-train' on the training set by over-fitting the data which 
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Figure 9. The combined 2dFGRS eigenspectrum corresponding to the Fisher criterion derived for determining galaxy morphology. It can be shown that the 
Fisher method is almost exclusively quantifying the magnitude of the 4000A break, and that the entire classification can be created using only the part of 
this spectrum at wavelengths below 4800A. The broad feature at ^5200A (corresponding to Mgb) is erroneous, and has resulted from the fact that we have 
truncated the series of eigenspectra just as this was feature was becoming prominent. 



it is supplied. For this reason the minimum in the error function 
(Eqn. [ll|) is calculated from the testing data set, rather than the 
training set. Note that for our purposes a galaxy was classified as 
Early type if the output probability from the ANN from the Early 
type node was greater than 0.5, and likewise for Late types. 

In general it was found that the results were relatively insensi- 
tive to the choice of network architecture (changing by only a few 
percentage points of successfully detected galaxies of each type). 
This is consistent with the results from the previous section where 
it was found that the PCA eigenspectra generally carry the same 
physical information as each other - despite being 'statistically' in- 
dependent. 

The results from each ANN are summarised in Table. ^, and 
once again the (pci,pc2) components are shown for the galaxies 
which have been classified by the ANN (in this case the 9:9:2 con- 
figuration) in Fig. [uj. Again it can be seen that the classification 
can be quite accurately expressed using only these first two projec- 
tions. The correspondence with the 2dFGRS 77 classification is also 
shown. 

One problem that arose when attempting to separate the Early 
type galaxies in our sample was that we obtained a significant con- 
tamination from Late type galaxies (~ 50%). It was found that this 
fractional contamination could be reduced somewhat by increas- 
ing the complexity of the ANN, however this resulted in a lower 
percentage of the actual Early type galaxies being correctly classi- 
fied. The actual degree of this contamination will vary according to 
the galaxy population under consideration. In the case of the 2dF- 
GRS we are presented with a bj -selected sample of galaxies and 
as such there is a significantly higher proportion of Late type (star- 



Table 2. Success rates of the ANNs. The numbers given are percentages of 
galaxies correctly classified for each morphological type. 



Network 


Configuration 


Early 


Late 


1 


9:5:2 


67% 


80% 


2 


9:9:2 


70% 


82% 


3 


9:5:5:2 


67% 


83% 



forming) galaxies than Early types. On the other hand, in the case 
of a 2MASS (Jarrett et al. 2000) near-infrared selected galaxy sam- 
ple, the proportion of types is reversed (Madgwick & Lahav 2001) 
and so a more pure sample of Early types can be determined. 



6 APERTURE EFFECTS 

Because our sample of morphologically classified galaxies is drawn 
from only the most nearby (and hence the most extended on the 
sky) galaxies, we must consider the possibility that the observed 
spectra are not representative of the galaxies as a whole. For ex- 
ample, the 2dF fibre (diameter 2-2.16") may only sample the light 
from the bulge of a nearby spiral galaxy - the spectrum from which 
tends to be more similar to that of an Early type galaxy. 

In order to test the importance of redshift upon the success of 
our morphological classifier we return to our ANN (configuration 
9:9:2), trained previously. If aperture effects are important then we 
would expect to see that the ANN will recover the galaxy morphol- 
ogy of relatively distant galaxies more accurately than for nearby 
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Figure 10. The pci and pc2 projections of the morphologically classified 2dFGRS galaxies are shown. Here the morphologies have been derived using 
an Artificial Neural Network with 9 principal components as inputs (9:9:2). Again we can see that all the information in these 9 components is essentially 
contained in just these first two projections. 



galaxies, particularly in the case of Late type galaxies. For this rea- 
son the results from the previous section were re-determined after 
dividing the testing sample into two redshift bins, z < 0.05 (1533 
galaxies) and z > 0.05 (761 galaxies). 

The results are summarised in Table. ^| Contrary to our expec- 
tations the success of the classification is relatively immune to the 
redshift being sampled. Clearly the situation must be more complex 
than a simple analysis such as this can resolve. 

One important aspect of the spectra being used in this analysis 
which may explain this situation is the substantial seeing present 
at the Anglo- Australian Telescope site. This seeing acts to 'smooth 
out' any spectral gradients which may be present in a given galaxy. 
In general the seeing is of the order of 1.8-2.5" and so can effec- 
tively double the area being sampled by the 2dF fibre aperture. 

Another major consideration in an analysis such as this is the 
stability of the morphological classification with redshift - as more 
distant galaxies will tend to be fainter and less extended on the sky. 
In general the robustness of a morphological classification is diffi- 
cult to assess because of its subjectivity. It has been found in pre- 
vious work (Nairn et al. 1995) that this subjective element results 
in an uncertainty of the order of 2 T-Types. However, this figure 
does not incorporate the uncertainties introduced to the classifica- 
tion through inclination, obscuration and other systematic uncer- 
tainties. All of these uncertainties add to the importance of being 
able to estimate morphologies in a more robust manner such as by 
correlating with galaxy spectra, or indeed for neglecting morphol- 
ogy altogether and simply using a spectral-based classification (see 
e.g. Madgwick & Lahav 2001). 

Because the galaxy sample considered here has been restricted 
to apparent magnitudes greater than bj — 16.5, misclassification is 
not considered to be as significant an issue in this analysis as it 
might otherwise be. However when repeating the above analysis 
using galaxies fainter than this magnitude limit a very substantial 



Table 3. Success rates of the 9:9:2 ANN for different redshift slices in the 
testing data set. The numbers given are percentages of correctly classified 
galaxies of each specified morphological type. 



Redshift 


iVtot 


Early 


Late 


z < 0.05 


1533 


72% 


82% 


z > 0.05 


761 


67% 


82% 



systematic misclassification of spirals was observed. This was to be 
expected since the spiral arms of such galaxies will become more 
difficult to resolve at higher redshift (where most of the faintest 
galaxies will reside), particularly for galaxies inclined to the line- 
of-sight. 



7 DISCUSSION 

Perhaps one of the most interesting aspects of this work on recov- 
ering galaxy morphologies from their spectra, is how closely re- 
lated the results from advanced statistical methods appear to be to 
the original 2dFGRS spectral classification, r\. In some regards this 
was to be expected, since one is always inclined to relate one's 
classification to galaxy morphology during its derivation. For ex- 
ample Folkes et al. (1999) used a training set of 26 galaxies drawn 
from the Kennicutt Atlas (Kennicutt 1992) as a training set to derive 
the original 2dFGRS spectral classification. These 26 galaxies were 
'projected' onto the {pc\,pc2) plane defined by the 2dFGRS spectra 
and lines were drawn by-hand to roughly separate the galaxies ac- 
cording to their assumed morphologies. However, in the case of r\ 
this method was not used, rather the galaxies were classified solely 
on the basis of finding the most statistically significant projection in 
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the PCA which was robust to the known instrumental uncertainties 
in the 2dF instrument (see Madgwick et al. 2002 for more details). 

The overall success of correlating galaxy morphologies and 
spectra, using the methods considered in this paper, is summarised 
in Fig. [HJ where the percentages of galaxies classified correctly 
is shown for each method. In the case of the r\ spectral classifica- 
tion and the Fisher discriminant it is possible to change the relative 
success rates between types simply by adopting different 'cuts' in 
these continuous parameters. This is demonstrated in Fig. [H| where 
the success rates are shown for both an ?y < —1.4 cut (2dFGRS de- 
fault) and an r\ < — 2 cut. 

In general, the most successful method of correlating galaxy 
morphology with spectra appears to be the Artificial Neural Net- 
work, which can achieve consistently high success rates for both 
types of galaxies. However, in deciding whether to implement such 
an algorithm the relative pay-off must be weighed against the ad- 
ditional complexity of this algorithm. This is particularly true for 
this data-set since all three methods of classification make roughly 
similar distinctions between morphological types. However, with 
the advent of higher resolution (and signal-to-noise) spectra, it is 
possible that such advanced methods will become much more suc- 
cessful. 

Note that in practical situations the interpretation of Fig. |ll| is 
not as straight-forward as it might at first seem. One must also bear 
in mind the relative fractions of different galaxy types in the sam- 
ple under consideration. In the case of the 2dFGRS, there are over 
2 times as many Late type galaxies as Early types (2610 and 1289 
galaxies respectively), so a decrease in the success rate of classify- 
ing Late types by 5% implies there will be an additional 250 galax- 
ies classified as Early type, which is at least a 10% contamination of 
the Early type sample. This is particularly important if one wants to 
create a relatively 'pure' sample of a particular type of galaxy (e.g. 
for efficient use of telescope time during observational follow-ups). 



8 CONCLUSIONS 

Establishing a firm link between a galaxy's morphology and its 
spectrum is advantageous for several reasons. For instance, galaxy 
spectra can be accurately determined to much greater redshifts and 
for fainter objects than morphologies. Also, most large redshift sur- 
veys currently taking place will contain many thousands of galaxy 
spectra but little information relating to the optical morphologies of 
those galaxies. In particular the separation of different morpholog- 
ical types of galaxies in these redshift surveys will be very useful 
as a means of separating objects for follow-up observations to de- 
termine independent distance measurements using either D n — a 
or the Tully-Fisher relation. 

In this paper I have tried to quantify the link between galaxy 
spectra and morphology using several advanced statistical meth- 
ods; namely, Fisher's linear discriminant and Artificial Neural Net- 
works. The best results produced suggest that it is possible to use 
optical galaxy spectra to create galaxy samples containing 70% of 
the Early type galaxies present and 80% of the Late types respec- 
tively. The contamination between these samples depends on the 
morphological mix of the survey under consideration. In the case 
of the bj -selected 2dFGRS the most significant contamination will 
be of mis-classified Late types in the Early type sample (~ 40% 
contamination), in the case of a near-infrared selected sample this 
situation will be reversed. 

Essentially the results obtained using more advanced statis- 
tical techniques (Sections 4 and 5) are comparable to those that 
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Figure 11. Comparison between the success rates of different classifica- 
tion methods. The first two sets of histograms show the success rates that 
can be achieved simply by using the (PCA based) r\ spectral classification 
adopted in the 2dFGRS. The second two histograms show the success rates 
for the more advanced statistical methods: Fisher's linear discriminant and 
the ANN. It can be seen that the results are generally comparable, although 
the ANN gives the best results. 



could be obtained simply using the default 2dFGRS spectral classi- 
fication r) (Madgwick et al. 2002) which can be accessed from the 
2dFGRS database|. This is an interesting result and certainly adds 
significantly to the physical interpretation of this parameter. 

Another interesting aspect of this analysis is that the Fisher 
discriminant (Section 4) identified the 4000A break to be the most 
essential element of a galaxy's spectrum for the purposes of esti- 
mating its morphology. This result is somewhat expected since the 
general correlation between galaxy morphology and colour is al- 
ready well established. However, it is intriguing to see this result 
derived in a quantitative manner from the observed spectra them- 
selves. 

The results presented in this paper are essentially limited by 
the coarseness of the morphological classification adopted, which 
for practical reasons can only be divided into two separate types 
(rather than a more realistic sequence of types). As larger sam- 
ples of more accurately morphologically classified galaxies become 
available it will be interesting to repeat the analysis presented here, 
in order to determine whether even more information can be recov- 
ered to link a galaxy's morphology and spectrum. 
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